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A timely combination of new theoretical ideas and observational discoveries has 
brought about significant advances in our understanding of cosmic evolution. Com- 
puter simulations have played a key role in these developments by providing the 
means to interpret astronomical data in the context of physical and cosmological 
theory. In the current paradigm, our Universe has a flat geometry, is undergoing 
accelerated expansion and is gravitationaly dominated by elementary particles that 
make up cold dark matter. Within this framework, it is possible to simulate in a 
computer the emergence of galaxies and other structures from small quantum fluc- 
tuations imprinted during an epoch of inflationary expansion shortly after the Big 
Bang. The simulations must take into account the evolution of the dark matter as 
well as the gaseous processes involved in the formation of stars and other visible 
components. Although many unresolved questions remain, a coherent picture for 
the formation of cosmic structure in now beginning to emerge. 
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1. Introduction 

The origin of structure in the Universe is a central problem in Physics. Its solution 
will not only inform our understanding of the processes by which matter became 
organized into galaxies and clusters, but it will also help uncover the identity of 
the dark matter, offer insights into events that happened in the early stages of the 
Big Bang and provide a useful check on the values of the fundamental cosmological 
parameters estimated by other means. 

Because of its non-linear character, lack of symmetry and general complexity, 
the formation of cosmic structure is best approached theoretically using numerical 
simulations. The problem is well posed because the initial conditions - small per- 
turbations in the density and velocity field of matter - are, in principle, known from 
Big Bang theory and observations of the early Universe, while the basic physical 
principles involved are understood. The behaviour of the dark matter is governed 
primarily by gravity, while the formation of the visible parts of galaxies involves gas 
dynamics and radiative processes of various kinds. Using cosmological simulations 
it is possible to follow the development of structure from primordial pertubations 
to the point where the model can be compared with observations. 

Over the past few years, there has been huge progress in quantifying observation- 
ally the properties of galaxies not only in the nearby universe, but also in the very 
distant universe. Since the clustering pattern of galaxies is rich with information 



Article submitted to Royal Society 



TgX Paper 



2 



C. S. Frenk 



about physics and cosmology, much effort is invested in mapping the distribution 
of galaxies at different epochs. Two large ongoing surveys, the US-based Sloan 
Digital Sky Survey (York et al. 2000), and the Anglo- Australian "2-degree field 
galaxy redshift survey" (2dFGRS, Colless et al. 2001), are revolutionizing our view 
of the nearby universe with order of magnitude increases in the amount of available 
data. Similarly, new data collected in the past five years or so have, for the first 
time, opened up the high redshift universe)]] to detailed statistical study (Steidel et 
al. 1996). 

The advent of large computers, particularly parallel supercomputers, together 
with the development of efficient algorithms, has enabled the accuracy and realism 
of simulations to keep pace with observational progress. With the wealth of data 
now available, simulations are essential to interpret astronomical data and to link 
them to physical and cosmological theory. 

2. Building a model 

To build a model of large-scale structure, four key ingredients need to be specified: 
(i) the content of Universe, (ii) the initial conditions, (hi) the growth mechanism, 
and (iv) the values of fundamental cosmological parameters. I now discuss each of 
these in turn. 

(a) The content of the Universe 

Densities are usually expressed in terms of the cosmological density parameter, 
= p/pcrit, where the critical density, p C rit, is the value that makes the geometry 
of the Universe flat. The main constituents of the Universe and their contribution 
to Q are listed in Table 1. 

Table 1. The content of the Universe 



Component Contribution to ft 



CMB radiation 




4.7 x 10- 


■5 


massless neutrinos 


fi* = 


3 x 10~ 5 




massive neutrinos 


0„ = 


6 x 10~ 2 


( <m„>\ 
\ lev ) 


baryons 


n b = 


0.037 ±0 


.009 


(of which stars ) 


tt s = 


(0.0023 - 


0.0041) ± 0.0004 


dark matter 




~ 0.3 




dark energy 




: 0.7 





The main contribution to the extragalactic radiation field today is the cosmic 
microwave background (CMB), the redshifted radiation left over from the Big Bang. 
These photons have been propagating freely since the epoch of "recombination" , 
approximately 300,000 years after the Big Bang. The CMB provides a direct ob- 
servational window to the conditions that prevailed in the early Universe. The Big 

f In cosmology, distances to galaxies are estimated from the redshift of their spectral lines; 
higher redshifts correspond to more distant galaxies and thus to earlier epochs. 
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Bang also produced neutrinos which today have an abundance comparable to that 
of photons. We do not yet know for certain what, if any, is the mass of the neutrino, 
but even for the largest masses that seem plausible at present, ~ O.leV, neutrinos 
make a negligible contribution to the total mass budget (although they could be as 
important as baryons). The abundance of baryons is now known with reasonable 
precision from comparing the abundance of deuterium predicted by Big Bang the- 
ory with observations of the absorption lines produced by intergalactic gas clouds 
at high redshift seen along the linc-of-sight to quasars (Tytler et al. 2000). Baryons, 
the overwhelming majority of which are not in stars today, are also dynamically 
unimportant (except, perhaps, in the cores of galaxies). 

Dark matter makes up most of the matter content of the Universe today. To 
the now firm dynamical evidence for its existence in galaxy halos, even more di- 
rect evidence has been added by the phenomenon of gravitational lensing which 
has now been detected around galaxy halos (e.g. Fischer et al. 2000, McKay et 
al. 2001, Wilson et al. 2001), in galaxy clusters (e.g. Clowe et al. 2000), and in 
the general mass field (e.g. Van Waerbeke et al. 2001 and references therein). The 
distribution of dark matter in rich clusters can be reconstructed in fair detail from 
the weak lensing of distant background galaxies in what amounts virtually to imag- 
ing the cluster dark matter. Various dynamical tests are converging on a value of 
J7(jm — 0.3, which is also consistent with independent determinations such as those 
based on the baryon fraction in clusters (White et al. 1993, Evrard 1997), and 
on the evolution in the abundance of galaxy clusters (Eke et al. 1998, Borgani et 
al. 2001). Since f^dm is much larger than fi;,, it follows that the dark matter cannot 
be made of baryons. The most popular candidate for the dark matter is a hypothet- 
ical elementary particle like those predicted by supersymmetric theories of particle 
physics. These particles are referred to generically as cold dark matter or CDM. 
(Hot dark matter is also possible, for example, if the neutrino had a mass of ~ 5 
eV. However, early cosmological simulations showed that the galaxy distribution in 
a universe dominated by hot dark matter would not resemble that observed in our 
Universe (White, Frenk and Davis 1983).) 

A recent addition to the cosmic budget is the dark energy, direct evidence for 
which was first provided by studies of type la supernovae (Riess et al. 1998, Perl- 
mutter et al. 1999)[j] These presumed 'standard candles' can now be observed at 
redshifts between 0.5 and 1 and beyond. The more distant ones are fainter than 
would be expected if the universal expansion were decelerating today, indicating 
that the expansion is, in fact, accelerating. Within the standard Friedmann cos- 
mology, there is only one agent that can produce an accelerating expansion. This is 
nowadays known as dark energy, a generalization of the cosmological constant first 
introduced by Einstein, which could, in principle, vary with time. The supernova 
evidence is consistent with the value Q\ — 0.7. Further, independent evidence for 
dark energy is provided by a recent joint analysis of CMB data (see next section) 
and the 2dFGRS (Efstathiou et al. 2002). 

Amazingly, when all the components are added together, the data are consistent 
with a flat universe: 

f The possibility that dark energy might be the dynamically dominant component had been 
anticipated by theorists from studies of the cosmic large-scale structure (see e.g. Efstathiou et 
al. 1990), and was considered in the first simulations of structure formation in cold dark matter 
universes (Davis et al. 1985). 
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n = n b + n dm + n A ~ 1 (2.1) 

(b) The initial conditions 

The idea that galaxies and other cosmic structures are the result of the slow 
amplification by the force of gravity of small primordial perturbations present in 
the mass density at early times goes back, at least, to the 1940s (Lifshitz 1946). 
However, it was only in the early 1980s that a physical mechanism capable of 
producing small perturbations was identified. This is the mechanism of inflation, 
an idea due to Guth (1981), which changed the face of modern cosmology. Inflation 
is produced by the dominant presence of a quantum scalar field which rolls slowly 
from a false to the true vacuum, maintaining its energy density approximately 
constant and causing the early Universe to expand exponentially for a brief period of 
time. Quantum fluctuations in the inflaton field are blown up to macroscopic scales 
and become established as genuine adiabatic ripples in the energy density. Simple 
models of inflation predict the general properties of the resulting fluctuation field: 
it has Gaussian distributed amplitudes and a near scale-invariant power spectrum 
(Starobinskii 1982). 

After three decades of ever more sensitive searches, evidence for the presence of 
small fluctuations in the early universe was finally obtained in 1992. Since prior to 
recombination the matter and radiation fields were coupled, fluctuations in the mass 
density are reflected in the temperature of the radiation. Temperature fluctuations 
in the CMB were discovered by the COBE satellite (Smoot et al. 1992) and are now 
being measured with ever increasing accuracy, particularly by detectors deployed 
in long-flight balloons (de Bernardis et al. 2000, Hanany et al. 2000, Leitch et 
al. 2002). The spectrum of temperature fluctuations is just what inflation predicts: it 
is scale invariant on large scales and shows a series of "Doppler" or "acoustic" peaks 
which are the result of coherent acoustic oscillations experienced by the photon- 
baryon fluid before recombination. The characteristics of these peaks depend on the 
values of the cosmological parameters. For example, the location of the first peak is 
primarily determined by the large-scale geometry of the Universe and thus by the 



value of fl. Current data imply a flat geometry, consistent with eqn. 2.1. 

The spectrum of primordial fluctuations generated, for example, by inflation 
evolves with time in a manner that depends on the content of the Universe and 
the values of the cosmological parameters. The dark matter acts as a sort of filter, 
inhibiting the growth of certain wavelengths and promoting the growth of others. 
Following the classical work of Bardeen et al. (1986), transfer functions for different 
kinds of dark matter (and different types of primordial fluctuation fields, including 
non-Gaussian cases) have been computed. In Gaussian models, the product (in 
Fourier space) of the primordial spectrum and the transfer function, together with 
the growing mode of the associated velocity field, provides the initial conditions for 
the formation of cosmic structure. 

(c) Growth mechanism 

Primordial fluctuations grow by gravitational instability: overdense fluctuations 
expand linearly, at a retarded rate relative to the Universe as a whole, until even- 
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tually they reach a maximum size and collapse non-linearly to form an equilibrium 
(or 'virialized') object whose radius is approximately half the physical size of the 
perturbation at maximum expansion. The theory of fluctuation growth is lucidly 
explained by Peebles (1980). 

Although gravitational instability is now widely accepted as the primary growth 
mechanism responsible for the formation of structure, it is only very recently that 
firm empirical evidence for this process was found. Gravitational instability causes 
inflow of material around overdense regions. From the perspective of a distant ob- 
server, this flow gives rise to a characteristic infall pattern which is, in principle, 
measurable in a galaxy redshift survey by comparing the two-point galaxy correla- 
tion function along and perpendicular to the line-of-sight. In this space, the infall 
pattern resembles a butterfly (Kaiser 1987). This pattern has been clearly seen for 
the first time in the 2dFGRS (Peacock et al. 200 1)|. 

(d) Cosmological parameters 

After decades of debate, the values of the fundamental cosmological parameters 
are finally being measured with some degree of precision. The main reason for this is 
the accurate measurement of the acoustic peaks in the CMB temperature anisotropy 
spectrum whose location, height and shape depend on the values of the cosmological 
parameters. Some parameter degeneracies exist but some of these can be broken 
using other data, for example, the distant Type la supernovae or the 2dFGRS 
(eg. Efstathiou et al. 2002). The CMB data alone do not constrain the Hubble 
constant, but there is a growing consensus from the HST key project (Freedman 
et al. 2001), and other methods, that its value, in units of 100 km s _1 Mpc, -1 
is h = 0.70 ± 0.07. In addition to h and the other parameters listed in Table 1, 
the other important number in studies of large-scale structure is the amplitude of 
primordial density fluctuations which is usually parametrized by the quantity as 
(the linearly extrapolated value of the top-hat filtered fluctuation amplitude on 
the fiducial scale of 8/i _1 Mpc). The best estimate of this quantity comes from 
the observed abundance of rich galaxy clusters which gives osSl 0,6 = 0.5, with an 
uncertainty of about 10% (Eke, Cole & Frenk 1996, Viana & Liddle 1996, Pierpaoli 
et al. 2001). 

3. Cosmological simulations 

Operationally, the problem of the cosmic large-scale structure can be divided into 
two parts: understanding the clustering evolution of the dark matter and under- 
standing the gaseous and radiative processes that lead to the formation of galaxies. 
Specialized simulation techniques have been developed to tackle both aspects of 
the problem. The evolution of the dark matter is most often calculated using N- 
body techniques, implemented through a variety of efficient algorithms, such as 
P 3 M (Particle-particle/particle-mesh; Efstathiou et al. 1985), AP 3 M (the adap- 
tive mesh version of P 3 M; Couchman et al. 1995) and hierarchical trees (Barnes 
& Hut 1986, Springel et al. 2001, Stadel 2000). Gaseous and radiative processes 

t Strictly speaking the 'butterfly' pattern does not prove the existence of infall since the conti- 
nuity equation would ensure a similar pattern even if velocities were induced by non-gravitational 
processes. However, it can be shown that such velocities, if present, would rapidly decay. 
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are followed by combining a hydrodynamics code with an N-body code. Numerical 
hydrodynamic techniques used in cosmology include Eulerian methods (Cen 1992), 
Lagrangian codes based on Smooth Particle Hydrodynamics (SPH) (Gingold & 
Monaghan 1977), and hybrid codes (e.g. Gnedin 1995, Pen 1998). These techniques 
have different strengths and weaknesses, but they all give similar results in the 
simplest cosmological problems where a detailed comparison has been performed 
(Frenk et al. 1999). 

There has been a rapid growth in the size and power of cosmological simulations 
in the two and a half decades since this technique was introduced into the subject 
by Peebles (1970). One way to measure this growth is by the number of particles 
employed in the simulations. The size of the largest simulations has grown expo- 
nentially, in a manner reminiscent of the well-known "Moore's law" that describes 
the increase in cpu speed with time, except that the advent of massively parallel 
supercomputers led to a sudden order-of-magnitude jump in size towards the end 
of the past decade. The largest simulations carried out to date are the 1-billion 
particle "Hubble volume," N-body simulations performed by the Virgo consortium, 
an international collaboration of reseachers in the UK, Germany and Canada. 



(a) Large-scale structure 

Figure |l| illustrates the spatial distribution of dark matter at the present day, 
in a series of simulations covering a large range of scales. Each panel is a thin 
slice of the cubical simulation volume and shows the slightly smoothed density field 
defined by the dark matter particles. In all cases, the simulations pertain to the 
"ACDM" cosmology, a flat cold dark matter model in which £!d m = 0.3, £1a = 0.7 
and h = 0.7. The top-left panel illustrates the Hubble volume simulation: on these 
large scales, the distribution is very smooth. To reveal more interesting structure, 
the top right panel displays the dark matter distribution in a slice from a volume 
approximately 2000 times smaller. At this resolution, the characteristic filamentary 
appearance of the dark matter distribution is clearly visible. In the bottom-right 
panel, we zoom again, this time by a factor of 5.7 in volume. We can now see 
individual galactic-size halos which preferentially occur along the filaments, at the 
intersection of which large halos form that will host galaxy clusters. Finally, the 
bottom-left panel zooms into an individual galactic-size halo. This shows a large 
number of small substructures that survive the collapse of the halo and make up 
about 10% of the total mass (Klypin et al. 1999, Moore et al. 1999) 

For simulations like the ones illustrated in Figure |l|, it is possible to characterize 
the statistical properties of the dark matter distribution with very high accuracy. 
For example, Figure || shows the 2-point correlation function, £(r), of the dark 
matter (a measure of its clustering strength) in the simulation depicted in the top- 
right of Figure [l] (Jenkins et al. 1998). The statistical error bars in this estimate 
are actually smaller than the thickness of the line. Similarly, higher order clustering 
statistics, topological measures, the mass function and clustering of dark matter 
halos and the time evolution of these quantities can all be determined very precisely 
from these simulations (e.g. Jenkins et al. 2001, Evrard et al. 2002). In a sense, the 
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Figure 1. Slices through 4 different simulations of the dark matter in the 'ACDM" cosmol- 
ogy. Denoting the number of particles in each simulation by N, the length of the simulation 
cube by L, the thickness of the slice by t, and the particle mass by m p , the characteristics 
of each panel are as follows. Top-left (the Hubble volume simulation, Evrard et al. 2002): 
iV = 10 9 , L = 3000 ft" 1 Mpc, t = 30 ft" 1 Mpc, m p = 2.2 x 10 12 ft -1 M s . Top-right (Jenkins 
et al. 1998): N = 16.8 x 10 6 , L = 250 ft" 1 Mpc, t = 25 ft" 1 Mpc, m p = 6.9 x 10 10 h~ x M®. 
Bottom-right (Jenkins et al. 1998): N = 16.8 x 10 6 , L = 140 ft" 1 Mpc, t = 14 ft" 1 Mpc, 
m p = 1.4 x 10 10 ft _1 M©. Bottom-left (Navarro et al. 2002): N = 7x 10 6 , L = 0.5 ft" 1 Mpc, 
t = 1 ft" 1 Mpc, m p = 6.5 x 10 5 ft _1 M . 

problem of the distribution of dark matter in the ACDM model can be regarded as 
largely solved^. 

f However, the innermost structure of halos like those in the bottom-left of Figure |l] is still a 
matter of controversy. 
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Figure 2. Two-point correlation functions. The dotted line shows the dark matter £d m (r) 
(Jenkins et al. 1998). The solid line shows the galaxy predictions of Benson et al. (2000), 
with Poisson errors indicated by the dashed lines. The points with errorbars show the 
observed galaxy £, ga i(r) (Baugh 1996). The galaxy data are discussed in §3(b). (Adapted 
from Benson et al. (2001a). 

In contrast to the clustering of the dark matter, the process of galaxy forma- 
tion is still poorly understood. How then can dark matter simulations like those of 
Figure |l| be compared with observational data which, for the most part, refer to 
galaxies? On large scales a very important simplification applies: for Gaussian the- 
ories like CDM, it can be shown that if galaxy formation is a local process, that is, 
if it depends only upon local physical conditions (density, temperature, etc), then, 
on scales much larger than that associated with individual galaxies, the galaxies 
must trace the mass, i.e. on sufficiently large scales, £ sa z(0 oc ^ m (r) (Coles 1993). 
It suffices therefore to identify a random subset of the dark matter particles in the 
simulation to obtain an accurate prediction for the properties of galaxy clustering 
on large scales. This idea (complemented on small scales by an empirical prescrip- 
tion in the manner described by Cole et al. 1998) has been used to construct the 
mock versions of a region of the APM galaxy survey and of a slice of the 2dFGRS 
displayed in Figures [3J and || which also show the real data for comparison in each 
case. By eye at least, it is very difficult to distinguish the mocks from the real data. 

A quantitative comparison between simulations and the real world is carried out 
in Figure [|. The symbols show the estimate of the power spectrum in the 2dFGRS 
survey (Percival et al. 2001). This is the raw power spectrum convolved with the 
survey window function and can be compared directly with the line showing the 
theoretical prediction obtained from the mock catalogues which have exactly the 
same window function. The agreement between the data and the ACDM model is 
remarkably good. 
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Figure 3. The region of the APM projected galaxy survey from which the 2dFGRS is 
drawn. Only galaxies brighter than mi,, = 19.35 are plotted. The top panel is the real 
data and the other two panels are mock catalogues constructed from the Hubble volume 
simulations. 



(b) Galaxy formation 

Understanding galaxy formation is a much more difficult problem than under- 
standing the evolution of the dark matter distribution. In the CDM theory, galaxies 
form when gas, initially well mixed with the dark matter, cools and condenses into 
emerging dark matter halos. In addition to gravity, a non-exhaustive list of the 
processes that now need to be taken into account includes: the shock heating and 
cooling of gas into dark halos, the formation of stars from cold gas and the evolution 
of the resulting stellar population, the feedback processes generated by the ejection 
of mass and energy from evolving stars, the production and mixing of heavy ele- 
ments, the extinction and reradiation of stellar light by dust particles, the formation 
of black holes at the centres of galaxies and the influence of the associated quasar 
emission. These processes span an enormous range of length and mass scales. For 
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Figure 4. A 1° thick slice through the 2dF galaxy redshift survey. The radial coordinate 
is redshift and the angular coordinate is right ascension. The top-left panel is the real 
data and the other two panels are mock catalogues constructed from the Hubble volume 
simulations. 

example, the parsec scale relevant to star formation is a factor of 10 8 smaller than 
the scale of a galaxy supercluster. 

The best that can be done with current computing techniques is to model the 
evolution of dark matter and gas in a cosmological volume with resolution compa- 
rable to a single galaxy. Subgalactic scales must then then be regarded as "sub- 
grid" scales and followed by means of phenomenological models based either on 
our current physical understanding or on observations. In the approach known as 
"semi- analytic" modelling (White & Frenk 1991), even the gas dynamics is treated 
phenomenologically using a simple, spherically symmetric model to describe the 
accretion and cooling of gas into dark matter halos. It turns out that this simple 
model works suprisingly well as judged by the good agreement with results of full 
N-body/gas-dynamical simulations (Benson et al. 2001b, Helly et al. 2002, Yoshida 
et al. 2002). 
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Figure 5. The power spectrum of the 2dFGRS (symbols) compared with the power spec- 
trum predicted in the ACDM model (line). Both power spectra are convolved with the 
2dFGRS window function. The model predictions come from dark matter simulations and 
assume that, on large scales, the distribution of galaxies traces the distribution of mass. 
(Adapted from Percival et al. 2001). 

The main difficulty encountered in cosmological gas dynamical simulations arises 
from the need to suppress a cooling instability present in hierarchical clustering 
models like CDM. The building blocks of galaxies are small clumps that condense 
at early times. The gas that cools within them has very high density, reflecting 
the mean density of the Universe at that epoch. Since the cooling rate is propor- 
tional to the square of the gas density, in the absence of heat sources, most of the 
gas would cool in the highest levels of the mass hierarchy leaving no gas to power 
star formation today or even to provide the hot, X-ray emitting plasma detected 
in galaxy clusters. Known heat sources are photoionisation by early generations of 
stars and quasars and the injection of energy from supernovae and active galactic 
nuclei. These processes, which undoubtedly happened in our Universe, belong to 
the realm of subgrid physics which cosmological simulations cannot resolve. Dif- 
ferent treatments of this "feedback" result in different amounts of cool gas and 
can lead to very different predictions for the properties of the galaxy population. 
This is a fundamental problem that afflicts cosmological simulations even when 
they are complemented by the inclusion of semi-analytic techniques. In this case, 
the resolution of the calculation can be extended to arbitrarily small mass halos, 
perhaps allowing a more realistic treatment of feedback. Although they are less 
general than full gasdynamical simulations, simulations in which the evolution of 
gas is treated semi-analytically make experimentation with different prescriptions 
relatively simple and efficient (Kauffmann White & Guiderdoni 1993, Somerville & 
Primack 1999, Cole et al 2000) 

The outcome of an N-body dark matter simulation in a ACDM universe in which 
the visible properties of the galaxies have been calculated using the semi-analytic 
model of Cole et al. (2000) is illustrated in Fig. || (Benson et al. 2001a). Galaxies 
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Figure 6. A slice 10 h 1 Mpc thick of a simulation of a cubic region of side 141 h' 1 Mpc 
in the ACDM cosmology. The grey scale shows a slightly smoothed representation of the 
dark matter in the N-body simulation. The coloured dots show galaxies; the size of the 
dots is proportional to the B-band luminosity of the galaxy and the colour represents the 
B-V colour as given on the scale on the top. The top panel corresponds to redshift z — 
and the bottom panel to z = 3. (Adapted from Benson et al. 2001a). 



form mostly along the filaments delineated by the dark matter. Red galaxies pre- 
dominate in the most massive dark matter halos, just as observed in real galaxy 
clusters. This segregation is a natural outcome of hierarchical clustering from CDM 
initial conditions. It reflects the fact that the progenitors of rich clusters form sub- 
stantially earlier than a typical dark matter halo of the same mass. Fig. shows 
the galaxy luminosity function which describes the abundance of galaxies of differ- 
ent luminosities. The theoretical predictions, shown by the line, agree remarkably 
well with the observations but this should not be regarded as a spectacular success 
of the theory because the free parameters in the semi-analytic star formation and 
feedback model have been tuned to achieve as good a match as possible to this 
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Figure 7. The galaxy luminosity function. The symbols show the number of galaxies per 
unit volume and per unit magnitude measured in various surveys, as a function of galaxy 
magnitude (open circles: Zucca et al. 1997; open squares: Loveday et al. 1992; thick error 
bars: Norberg et al. 2001b). The solid line shows the predictions of the semi-analytic model 
of Cole et al. (2000). 



specific observational dataset. In particular, the feedback model has been tuned to 
produce a relatively flat function at the faint end. 

Having fixed the model parameters by reference to a small subset of the data 
such as the galaxy luminosity function, we can ask whether the same model accounts 
for other basic observational data. The galaxy autocorrelation function, £ 90 j(r), in 
the simulations is plotted in Fig. |2| above. On large scales, it follows ^ m (r) quite 
closely, but on small scales it dips below the mass autocorrelation function. This 
small scale "antibias" has also been seen in N-body/gasdynamical simulations of 
the ACDM cosmology (Pearce et al. 1999, 2001, Dave et al. 1999), and in dark 
matter simulations that resolve individual galactic halos (Klypin et al. 1999). The 
galaxy autocorrelation function in the simulations of Benson et al. (2000) agrees 
remarkably well with the observational data (see also Kauffmann et al. 1999a). 
This is a genuine success of the theory because no model parameters have been 
adjusted in this comparison. The differences between the small-scale clustering of 
galaxies and dark matter result from the interplay between the clustering of dark 
matter halos and the occupation statistics of galaxies in halos which, in turn, are 
determined by the physics of galaxy formation. This conclusion, discussed in detail 
by Benson et al. (2000), has led to the development of an analytic formulation 
known as the "halo model" (e.g. Seljak 2000, Peacock & Smith 2000, Berlind & 
Weinberg 2002). 

Another genuine prediction of the model is the dependence of the strength of 
clustering on the luminosity of different subsamples. It can be seen in Fig. | that 
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Figure 8. The correlation length as a function of the luminosity of different galaxy sub- 
samples. The correlation length is defined as the pair separation for which £(r) = 1. The 
symbols show the results from the 2dFGRS and the line the predictions of the simulations 
of Benson et al. (2000). (Adapted from Norberg et al. 2001a). 

the brightest galaxies are concentrated in the most massive clusters, leading one 
to suspect that their autocorrelation function must be stronger than average. This 
is indeed the illustrated in Fig. |^ which compares the variation of the 

clustering length (defined as the pair separation for which £(r) = 1) of galaxy 
samples of different intrinsic luminosity in the simulations of Benson et al. (2001a) 
with the observational data obtained from the 2dFGRS by Norberg et al. (2001a). 
The agreement between theory and observations is remarkable considering that 
there are no adjustble parameters in this comparison. The reason for the strong 
clustering of bright galaxies is related to the colour-density relation seen in Fig. |f 
the brightest galaxies form in the highest peaks of the density distribution which, 
in initially Gaussian fields, are more strongly clustered than average peaks which 
produce less extreme galaxies. 

The patch of model universe illustrated in the top panel of Fig. [| is shown 
at the earlier epoch corresponding to redshift z = 3 (when the universe was only 
about 20% of its current age) in the bottom panel of this figure. The galaxies are 
now blue, reflecting the colour of their younger stellar population. There are fewer 
galaxies in this plot than in the z = slice. In fact, this is the epoch when the 
first substantial population of bright galaxies formed in the simulation. As Baugh 
et al. (1998) argued, the properties of these model galaxies resemble those of the 
"Lyman-break" galaxies discovered by Steidel et al. (1996), even though different 
models make somewhat different predictions for their exact properties (Somerville 
et al. 2001). Most models, however, predict that the brightest galaxies at z = 3 
should be strongly clustered (Kauffmann et al. 1999b) and, indeed, the models 
of Baugh et al. (1998) correctly anticipated that the Lyman-break galaxies would 
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have a clustering length comparable to that of bright galaxies today (Adelberger 
et al. 1998). This too should be regarded as a significant success of this kind of 
modelling in the ACDM cosmology. As Fig. ^ shows, in contrast to the galaxies, 
the dark matter is much more weakly clustered at z = 3 than at z = 0, indicating 
that galaxies were strongly biased at birth. 

4. Conclusions 

Unlike most computational problems in many areas of science, the cosmological 
problem is blessed with known, well-specified initial conditions. Within a general 
class of models, it is possible to calculate the properties of primordial perturbations 
in the cosmic energy density generated by quantum processes during an early in- 
flationary epoch. In a wide family of inflationary models, these perturbations are 
adiabatic, scale-invariant and have Gaussian-distributed Fourier amplitudes. The 
model also requires an assumption about the nature of the dark matter and the 
possibilities have now been narrowed down to non-baryonic candidates of which 
cold dark matter particles seem the most promising. An empirical test of the initial 
conditions for the formation of structure predicted by the model is provided by 
the cosmic microwave background radiation. The tiny temperature fluctuations it 
exhibits have exactly the properties expected in the model. Furthermore, the CMB 
data can be used to fix some of the key model parameters such as and fib, while 
these data, combined with other recent datasets such as the 2dFGRS, allow the 
determination of many of the remaining parameters such as f2 m , f^A and h. It this 
specificity of the cosmological problem that has turned simulations into the primary 
tool for connecting cosmological theory to astronomical observations. 

In addition to well-specified initial conditions, the cosmological dark matter 
problem has the advantage that the only physical interaction that is important is 
gravity. The problem can thus be posed as a gravitational N-body problem and 
approached using the many sophisticated techniques that have been developed over 
the past two decades to tackle this problem. Although on small scales there remain 
a number of unresolved issues, it is fair to say that on scales larger than a few mega- 
parsecs, the distribution of dark matter in CDM models is essentially understood. 
The inner structure of dark matter halos, on the other hand, is still a matter of de- 
bate and the mass function of dark matter halos has only been reliably established 
by simulations down to masses of order 10 11 M Q . Resolving these outstanding issues 
is certainly within reach, but this will require carefully designed simulations and 
large amounts of computing power. 

The frontier of the subject at present lies in simulations of the formation, evolu- 
tion and structure of galaxies. This problem requires first of all a treatment of gas 
dynamics in a cosmological context and a number of techniques, relying on direct 
simulations or on semi-analytical approximations, are being explored. There are 
quite a few different approaches to cosmological gasdynamics, but it is reassuring 
that they all give similar results in the simplest relevant problem, the evolution 
of non-radiative gas during the formation of a galaxy cluster. No detailed compar- 
isons exist yet for the more complicated case in which the gas is allowed to cool, 
but at least one of the gasdynamic simulation techniques, SPH, gives quite similar 
results to a simple semi-analytic approach. Realistic models of galaxy formation, 
however, will require much more than a correct treatment of cooling gas. Such mod- 
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els will necessarily have to include a plethora of astrophysical phenomena such as 
star formation, feedback, metal enrichment, etc. The huge disparity between the 
submegaparsec scales on which these processes operate and the gigaparsec scale of 
the large-scale structure makes it impossible to contemplate a comprehensive ab 
initio calculation. The way forward is clearly through a hybrid approach combining 
direct simulation of processes operating on a limited range of scales with a phc- 
nomenological treatment of the others. There is currently a great deal of activity 
in the phenomenology of galaxy formation. 

In spite of the uncertainties that remain, all the indications are that our Universe 
is well described by a model in which 

(i) the overall geometry is flat; 

(ii) the dominant dynamical components are cold dark matter (~ 30%) and dark 
energy (~ 70%) with baryons playing very much a supporting role (~ 4%); 

(iii) the initial conditions are quantum fluctuations in the primordial energy den- 
sity generated during inflation and 

(iv) structure has grown primarily as a result of the gravitational instability ex- 
perienced by mass fluctuations in an expanding universe. 

A skeptic is entitled to feel that the current paradigm is odd, to say the least. 
Not only is there a need to invoke vast amounts of as yet undetected non-baryonic 
cold dark matter, but there is also the need to account for the dominant presence 
of a dark energy whose very existence is a mystery within conventional models of 
fundamental physics. Odd as it may seem, however, this model accounts remarkably 
well for a large and diverse collection of empirical facts that span 13 billion years 
of evolution. 
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